Next Generation Cancer Data Discovery, Access, and Integration Using Prizms and Nanopublications
نویسندگان
چکیده
To encourage data sharing in the life sciences, supporting tools need to minimize effort and maximize incentives. We have created infrastructure that makes it easy to create portals that supports dataset sharing and simplified publishing of the datasets as high quality linked data. We report here on our infrastructure and its use in the creation of a melanoma dataset portal. This portal is based on the Comprehensive Knowledge Archive Network (CKAN) and Prizms, an infrastructure to acquire, integrate, and publish data using Linked Data principles. In addition, we introduce an extension to CKAN that makes it easy for others to cite datasets from within both publications and subsequently-derived datasets using the emerging nanopublication and World Wide Web Consortium provenance standards.
منابع مشابه
Genome Annotation using Nanopublications: An Approach to Interoperability of Genetic Data
With the widespread use of Next Generation Sequencing (NGS) technologies, the primary bottleneck of genetic research has shifted from data production to data analysis. However, annotated datasets produced by different research groups are often in different formats, making genomic comparisons and integration with other datasets challenging and time consuming tasks. Here, we propose a new data in...
متن کاملFinding Novel Associations Across Domains Using Linked Data: a Case Study on Genetic Variants Disrupting Transcription Start Sites
With the widespread use of Next Generation Sequencing technologies, the primary bottleneck of genetic research has shifted from data production to data analysis. However, heterogeneous data sets makes comparisons and integration challenging and time consuming. Here, we apply a data interoperability approach that provides unambiguous (machine readable) description of genomic annotations based on...
متن کاملUsing Nanopublications to Incentivize the Semantic Exposure of Life Science Information
The growing rate of data production in the life sciences creates an urgent need for semantic integration of information. Although the development of tools and infrastructure will make semantic data exposure easier with time, presently the effort associated with creating linked data remains largely unrecognized by peer-review processes, publishers, and promotion committees. Here, we describe a n...
متن کاملNext Generation Sequencing and its Application in the Study of Microbiome in Plant Diseases Suppressive Soils
Progress in next-generation sequencing has played a significant role in ecological studies of microbial populations. These advances have led to a rapid evaluation in metagenomics studies (analysis of DNA of microbial communities without the need to culture). Many statistical and computational tools and metagenomics databases have led to the discovery of huge amounts of data. In this research, i...
متن کاملPublishing DisGeNET as nanopublications
The increasing and unprecedented publication rate in the biomedical field is a major bottleneck for knowledge discovery in the Life Sciences. The manual curation of facts from published scientific papers is slow and inefficient, and therefore new approaches are needed that can enable the automatic, scalable and reliable extraction of assertions. While the publication of scientific assertions an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Data integration in the life sciences : ... International Workshop, DILS ... : proceedings. DILS
دوره 7970 شماره
صفحات -
تاریخ انتشار 2013